Rateless codes for near-perfect load balancing in distributed matrix-vector multiplication
نویسندگان
چکیده
Large-scale machine learning and data mining applications require computer systems to perform massive matrix-vector matrix-matrix multiplication operations that need be parallelized across multiple nodes. The presence of straggling nodes---computing nodes unpredictably slow down or fail---is a major bottleneck in such distributed computations. Ideal load balancing strategies dynamically allocate more tasks faster knowledge monitoring node speeds as well the ability quickly move data. Recently proposed fixed-rate erasure coding can handle unpredictable slowdown, but they ignore partial work done by nodes, thus resulting lot redundant computation. We propose rateless fountain strategy achieves best both worlds---we prove its latency is asymptotically equal ideal balancing, it performs zero Our idea create linear combinations m rows matrix assign these encoded different worker original product decoded soon slightly than row-vector products are collectively finished Evaluation on parallel computing yields much three times speedup over uncoded schemes.
منابع مشابه
Generating Vector Code for Matrix-matrix Multiplication
The current state of the art Matrix-Matrix-Multiplication (MMM) kernel is known as ATLAS, which generates the best performing MMM code by search. However, today’s computer architecture changes rapidly and it is hard to generate a high performance code without knowing how to use the new instruction sets. Since ATLAS does not make use of blocking for L2 cache, or SSE/SSE2 instruction, we are enco...
متن کاملDynamic Load Balancing for Distributed
In this paper we study diffusion schemes for dynamic load balancing on message passing multiprocessor networks. One of the main results concerns conditions under which these dynamic schemes converge and their rates of convergence for arbitrary topologies. These results use the eigenstructure of the iteration matrices that arise in dynamic load balancing. We completely analyze the hypercube netw...
متن کاملLoad Balancing in Distributed Systems
Distributed systems can be either connected computer or an array of single processor systems. The described method of speed performance, realized by load balancing procedures, can be used to both of these basic structures. By the application of genetic algorithms, using mutation as well as crossover, an optimal distribution mode is guaranteed.
متن کاملLoad Balancing In Distributed Computing
The paper presents what a Distributed system is. It also overviews the issues to be taken into consideration for developing the strategies for load balancing. Strategies for various types of Distributed systems are discussed. Each strategy considers the most economical transfer policy, selection policy, location policy, information policy. The paper concludes with comparison of various strategi...
متن کاملLoad Balancing in Distributed Systems
In Computer world Server load balancing is the process of distributing service requests across a group of servers. Users of one workstation are not hindered by the CPU intensive applications run on a different workstation. However, when a large number of machines in the network are idle, the efficiency of computation is decreased. The sharing of computing resources in a distributed system is a ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Communications of The ACM
سال: 2022
ISSN: ['1557-7317', '0001-0782']
DOI: https://doi.org/10.1145/3524298